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Abstract 

The paper considers two-phase random design hnear regression models. The errors 
and the regressors are stationary long-range dependent Gaussian. The regression 
parameters, the scale parameters and the change-point are estimated using a method 
introduced by Rousseeuw and Yohai [33]. This is called S-estimator and it has the 
property that is more robust than the classical estimators; the outliers don't spoil 
the estimation results. Some asymptotic results, including the strong consistency 
and the convergence rate of the S-estimators, are proved. 

Key words: Change-points, S-estimator, Long-memory, Asymptotic properties 
AMS 2000 subject classifications: primary 62F12; secondary 62H12, 60G15. 



1 Introduction 

Consider the two-phase linear regression model: 

Yt = Xtf3lTLl<t<[mr] + Xt(32'^[mT]+l<t<n + ^i, t = 1, 77, (l) 

where ]1(.) is the indicator function and vr G (0, 1), ^ = /32, vr), ^1,^2 G T. The 
set T is a compact of W^, d > 1. For this model, Yt denotes the response variable, 
Xf is a p-vector of regressors and et is the error. 
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The model parameters are: regression parameters (3i and (32, change-point tt and 
error variance o"^, with G (0, oo). Let us denote = /J^, tt") and Uq the true 
values of these parameters. In this paper we consider the problem of estimating of 
^ and 0"^, based on the observation of (Yt, Xt)i<t<n- 

Classical estimation methods studied in the statistic literature are the least squares 
(LS), maximum likelihood (ML) or a wider class M-estimation methods. For each 
of these methods one has to distinguish the cases when the errors are independent 
or not, and in the dependent case it is necessary to take into account the covariance 
structure. The same conditions can be considered for regressors Xt. In traditional 
methodology, these variables are usually assumed to be independent or with short- 
memory. So, if the errors are i.i.d. or with short-memory, the statistic literature 
related to the parametric change-point estimation is very vast. Recent develop- 
ments for the LS estimation include Feder (p3], [Hj), Bai and Perron p], Kim 
and Kim [22]. Bai [1] considers also the least squares estimation of a shift in lin- 
ear process. The process et is given by: et = J2'^o^j'^k-j, where Uj is white noise 
with mean zero and variance and the coefficients Cj satisfy I^^oikjl < 
This condition excludes long-memory. For the ML estimation we refer to Bhat- 
tacharya |7j, Koul and Qian |23j, Ciuperca and Dapzol [H]. In the general case of 
the M-estimator, we can cite the papers of Rukhin and Vajda [3l], Koul et al. p5]. 
Obviously, the list is not exhaustive, the subject is so large and productive that 
we cannot give all the papers. The convergence rate and limiting distributions of 
the change-point and of the regression parameters M-estimators are derived for 
the model ([T]) by Fiteni [T3] , under restrictive and numerous assumptions. Among 
these conditions she considers that {Yt,Xt) is a random vector, Lq-NED, on a 
strong mixing base {wt;t = ...,0,1,...}, p'{et + 6Xt)Xt is a random sequence of 
mean zero, L2-NED of size 1/2 on a strong mixing base {wt'-,t = ...,0, 1, ...} and 
snp^^^ IE [\\p'{et + 9Xt)Xt\\^ for some r > 2. Under the same dependence assump- 
tions, Fiteni [TU] considers the r-estimators. 

On the contrary, in the case of long-memory errors or regressors, the statistical 
literature related to the parametric change-point estimation is less vast. For the 
simpler model: 

Yt = fili<t<k* + ifJ' + S)lk*<t<n + £t (2) 

when the errors 6t are long-memory Gaussian, Horvath and Kokoszka [20] consid- 
ered the estimator of k* defined hj k = minjfc; \Uk\ = maxi<j<„ \Ui\}, where, for 
7 G [0, 1), f/j = Yl]=i{Xj — Xn). The estimator converges to functionals 

of fractional Brownian motion. For the same model. Hidalgo and Robinson [19j, 
Sibbertsen |l35J consider the LS estimator of k, fi and 6. A more complex model: 
Yt = fj, + {P + 61t<iTn])Xt + £t is considered in the paper of Lazarova [27], but with 
the supposition that r fixed. The limiting distribution of the LS estimator of the 
parameters (3 and 6 is given. 
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Concerning now the estimation method, it is well known that one outlier may 
cause a large error in a LS-estimator, ML-estimator or more generally in classical 
M-estimator. Nunes et al. ^29j, Kuan and Hsu observed that, for the data that 
have long-memory, the LS-estimator may suggest a spurious change-point when 
there is none. In that case, the parameters of the model can be estimated by using 
least absolute deviations (LAD) method. If the errors are independent, Bai [2] 
studies the LAD estimator for a multiple regime linear regression and Ciuperca 
[To] for a nonlinear change-point model. A more robust estimator was introduced 
by Rousseeuw and Yohai [33] , by defining the S-estimators as the minimizers of a 
M-estimator of the residual scale. The interest of the S-estimator in respect to the 
LAD-estimators, is the breakdown point introduced by Hampel [TH]. The break- 
down point amounts to determining the smallest contaminating mass that can 
cause the estimator to take on value arbitrary for from the true value. Instead, 
concerning this method, to the author's knowledge, the past papers treat only re- 
gression models without change-point. For a linear regression model, Davies [H] 
proves the consistency and weak convergence of S-estimator under the assump- 
tion that the errors are i.i.d. random variables. The asymptotic behaviour of the 
S-estimator in a linear regression, without change-point can be also found in the 
papers Zhengyan et al. [36], Roeland et al. [32] . 

In the present paper, we consider a linear regression model with a change-point 
in an unknown point. The regressors and the errors are assumed to be Gaussian 
vectors, and respectively variables, with long-memory. The regression parameters, 
the scale parameter and the change-point location are estimated by the S-method. 
The difficulty of study of the asymptotic properties of these S-estimators comes 
especially from the dependence on change-point in the expression of the scale pa- 
rameter estimators. We first prove that the estimators are strongly convergent and 
afterwards their convergence rates are obtained. These rates depend of covariance 
structure of Xt and et and of Hermite rank of p{et/cro) — IE[p{et/cro)], where p is 
the function used to construct the S-estimator. For the regression parameters and 
the scale parameter, we obtain the same convergence rate as in a model without 
change-point, let us denote it f„. The S-estimator of the change-point has a faster 
convergence rate, more precisely n~^f„. This result is totally different from those 
obtained in the other papers where the dependence between observations is con- 
sidered. Especially, let us notice that our change-point S-estimator converges more 
quickly towards true value than in the independence or the short-memory case or 
a classic estimation method. 

The plan of this paper is as follows. In Section 2 we make some notations and 
assumptions afterwards we define the S-estimator for a model with change-point. 
In Section 3, the asymptotic behaviour of these estimator is studied. The proofs 
of theorems are given in Section 4. Finally, Section 5 contains some lemmas which 
are useful to prove the main results. 
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2 Notation and assumptions 



Long-memory (long-range dependent) processes arise in numerous physical and 
social sciences. For several examples, see e.g. Baillie [Ij, Cheung [8], Lo [28] among 
others. We also mention Guo and Kuol p!7], where some currency exchange data 
sets with long-memory are considered. Another long-memory example in economy 
we find also in Ding et al. [12j on S&P daily 500 stock market returns. In that 
paper, they found that although the returns themselves contain little serial corre- 
lation, the absolute value of returns has significantly positive serial correlation up 
to 2700 lags. 

For the construction of the S-estimators, a function p : M — > [0, 1] is needed. 
Throughout our article, we assume that the following classic conditions are satisfied 
by p: 

• p is symmetric, continuously differentiable on M and p(0) = 0. 

• p is increasing in [0, c), for some c > 0, and constant in [c, oo). 
Let us denote: ip{z) = p'{z). 

An example of p satisfying these conditions was proposed by Beaton and Tukey 
[5], for some c > 0: 



For model ([T]), the following assumptions are considered: 

(Al) Xt is a sequence of (i-dimensional stationary long-range dependent Gaussian 
vectors, with E[Xt] = 0, covariance matrix T{t) = lE[XiXt+i] = C{t)^ N{t)C{t), 
where N{t) = diaglr^^ ...,t~^^), G (0,1) for t > 1 and 1(0) = Var{Xi). 

C{x) a d X d orthogonal matrix of slowly varying functions; 

(A2) Et a sequence of stationary long-range dependent Gaussian variables, with 
IE[et\ = 0, 7(0) = Kar[£:f] = ctq and the covariance 7(t) = lE[eiet+i] = t~"L{t), 
a G (0, 1) for t > 1. L{x) a positive slowly varying function; 
(A3) the errors are independent of Xt. 

The values of 6'i, 6^, a and the functions expressions of C{x), L{x) are known. 
Recall that a positive measurable function h is slowly varying in Karamata's sense 
if and only if, for any A > 0, h{Xx)/h{x) converges to 1 as x tends to infinity. 
Examples of slowly varying functions: logx, log log x, log log log a; 





1, 



if \x\ > c 



(3) 
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Interested readers are referred to Beran [H] or Robinson [3T] for a complete reference 
on long-memory processes. 

An example of process Xt = {Xa, Xtd) is obtained when, for some < < 1/2: 

d 

1=1 vdZ 

where Lji are slowly varying functions and where = (^^^i, <j^^rf)^, t> G Z are 
i.i.d. with j, j = 1, d standard Gaussian variables (see Koul and Baillie [23]). 
For the residual function, let us consider classical notation rt(/?) = Yt — XtP and 
let K the constant given by = ]E^[p{ei/ao)], where $ is standard Gaussian 
distribution. 

In order to construct the S-estimator in a change-point model ([T]), we proceed as 
follows: 

- first, for /52, ti") G T x T x (0, 1) fixed, scale parameter a is estimated by the 
positive solution ■s„(^) = /52, tt) of the equation: 



- at the second stage, the regression parameters are estimated by the argument of 
the minimum of solution obtained of the previous phase: 

(/3i„(7r),/32„(7r)) = arg rnin s„(/3i, /52, vr) (5) 

^ ^ (/3i,/32)eTxT 

- in the end, the change-point is estimated by: 



Tfn = arg min s„ (/3i„(7r), /52„(7r), tt (6) 

7rg[0,l] ^ ^ 

We shall make the usual identifiability assumption that the two segments are 
different: 

Pi^P2, V^gTxTx(0,1) (7) 

i.e. at least one of the coefficients of Xt has a shift. Thus the jump at vr is non-zero. 
This condition implies that the solution of ([6]) is unique and it will be essential in 
the proof of the strong consistency. 

If solution SniC) to dl]) exists then it is well-defined, bounded, strictly positive, with 
a probability arbitrarily large (see Lemma [5. ip . These results are valid regardless 
of the covariance structure of Xt, of et and their distribution. What matters is 
their average is worth and their variance is bounded. 

If has more than one solution, s„(.^) is defined as the supremum of all solutions. 
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Obviously, if function p is given by ([3]), thus equation (jlj) has at least a solution. 



In this context, we define (3"„ = s„ \j3in{T^n)/[^2n{T^n)^T^nj as the S-estimator of a 

and {(3in-,(^2n) = 0in{T^n)^J^2n{T^n)) that of (/?i,/?2)- We shall study the asymptotic 
behaviour of {Pin, P2n) and of 7r„, in the case that equation (jlj) has at least a 
solution. 



For any (p twice different iable function, for x, /i G M, throughout this paper we are 
going to use the mean value theorem under the form: 



ip{x + h) = ip{x) + h 



ip'{x) + h (1 - s)(f"{x + sh)ds 



(8) 



For a vector V = {vi, ■ ■ ■ ,Vm), let us denote by \\V\\ its Euclidean norm and we 



make the convention that \V\ = {\vi\ 



\Vm\)- 



In the following, we denote by C a generic positive finite constant that may be 
different in different context, but will never depend on n. 



3 Asymptotic behaviour 



This section establishes asymptotic properties of the S-estimator in model ([T]). For 
this purpose, first let us calculate, for solution Sn{0 of equation (jl]), the partial 
derivatives with respect to /3i and ^2- Differentiating (jlj) with respect to we 
obtain: 

^ r,(A)g.„(0 (n{Pi)\ " niP2)ds„iO /r,(/32)\ ^ X, /n(A)^ 
h SniO d(3, ^ \ SniO ; *=M+i ^niO dp, ^ 1, SniO ) h s^iO^ \ SniO J 

Considering the following notation: 

and by making similar calculation for 9s„(^)/9/52, we obtain: 

--n-^D.U)-^J:S^Hm) (10) 
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Since p is symmetric and increasing in [0,c)( and choosing suitably c) we have: 



xiIj{x) < 



> 0, if X G (-c,c) \ {0} 



0, if X = or |x| > c 



By means of Lemma [5.21 we prove that the random process Dn{C,)^^ is bounded 
with a probabihty close to 1. In fact, the covariance structure of Xt and of et, 
respectively, plays no role in this result. Moreover, if both random variables are 
no more Gaussian, Lemma 15.21 holds if Xf and et are bounded with a probability 
close to 1. 



In order to prove the consistency we require that function ip also is differentiable 
and strictly increasing on (0,c). This condition will be used for the Taylor's ex- 
pansion of p, around (/5°,/52), up to second order. 
(HI) ip{.) is differentiable and ^'{u) > 0, Vm G (0,c). 



Theorem 3.1 Under assumptions (A1)-(A3), (HI), we have that estimator 
hn.T^n) is strongly consistent: — ^ ■Co- 

Remark 3.1 Statement of Theorem \3. 1\ remains valid, if Xt is not Gaussian, hut 
it is i.i.d. and ]E[XtXj] < oo. If St is not Gaussian, it has to he hounded with a 
probahility close to 1. 

As a consequence of relation ffTUj) . the first two stages (jlj) and ^ in the construction 
of the parameters estimators, are the solutions to the equations system: 

(a) eH P (^) + n-^ EL^.j+i P (^) - ^ = 

(6) n-' eS ^ (^) Xt = (12) 

(c) ^-^ELm+i^(^)x, = o 

Since the change-point intervention is essential, the convergence study of the scale 
parameter estimator is realized separately. According to Theorem 13. we fix vr 
in a neighbourhood V(7r°) of vr''. In order to show the convergence of the scale 
parameter estimator, supplementary assumptions are needed. 



(H2) ip is twice differentiable with bounded second derivative. 
(H3) iIj{x)/x is nonincreasing for a; > 0. 
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Obviously, function satisfies assumptions (H1)-(H3). As will be seen below, 
assumption (H2) is needed to control the rest in the Taylor's expansion of 
while (H3) is used in order to apply results of Zhengyan et al. [36j on the consis- 
tency of the scale S-estimator in a model without change-point. Moreover, in the 
paper of Zhengyan et al. [36], the assumption (H3) is needed to show the conver- 
gence of the regression parameter estimator, which is not the case here. 



Theorem 3.2 Under (Al)-(A3), (H1)-(H3), ([^j, for all vr in a neighbourhood 
V(7r'^) ofn^, the estimator of ctq is strongly consistent: s„(/3i„('7r), /?2n(7r), vr) 

Corollary 3.1 Under (A1)-(A3), (H1)-(H3), (0), scale parameter S-estimator 
{Pin, f32n, 7r„) is strongly consistent for (Tq- 

Remark 3.2 In a model without change-point, the assumption (H2) is needed for 
found the convergence rate and the asymptotic distribution of the estimators but 
not in the convergence proof. 

Remark 3.3 The convergence result of Theorem \3.S\ holds if random vector Xt is 
not more Gaussian but i.i.d. with IE[Xt] = and lE[XtX'[] < oo. 

In order to find the convergence rate, we will use the Hermite expansion for a 
function of standard Gaussian variable (for details about the Hermite expansion 
see for example Palma [30]). Let us consider function %(.) := p(.) — K, where 
K = lEq,[p{ei/ao)]. Suppose that the Hermite rank of x (^) is Qi- Because function 
p is symmetric and p(0) = 0, we have qi > 2. If we denote ut = ^t/co, then: 

q>qi 

with Hg the Hermite polynomial, Jq{x) = ^[x{^i)Hq{i'i)] and for all t,t' = 1, ■ ■ ■ n: 

lE[Hp{iyt)HgM] = q\^\t - t')lp=, (13) 
Let also k = min{(agi)/2, (6*, + a)/2,l < i < d}. 

In order to have the rate of convergence of the estimators in a model without 
change-point, following assumptions are imposed by Zhengyan et al [36\: aqi < 1 
and max{a -|- 6j] 1 < j < d}. 

The following theorem gives the convergence rate of the regression parameters and 
of the scale parameter estimators. These rates are the same that in a model with- 
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out change-point. 



Theorem 3.3 For all n E (0, 1), if (Al)-(A3), (H1)-(H3), hold, we have 
||An(7r) - I3l\\ = Ojp ([mxY^L^in-K)) = Ojp {n-''L,{n)) 

||/32n(vr) - = Ojp ((n(l - n))-''L,{n{l - vr))) = Ojp {n'''L,{n)) 

where Li and Li are slowly varying functions. For the scale parameter, putting 
s„(7r) := Sn(/3in(vr),/32„(7r),7r), we have |sn(7r) - a°| = Ojp (n~''Li(n)). 

Now let us study the convergence rate of the change-point estimator: 

TTn = argniinsn (/3i„(7r), /52„(vr), vr) = argniin Sn (/3i„(7r), /32„(7r), vr) - , /J", ^r) 

For that we consider one of the last two equations of ( |T2l) . for instance (c): 



n 



-1 



t=[mr 



]+l \Sn[Pln{V,P2n{V,T^ 



7r„. - 7r° 



Theorem ^3. 4 Under assumptions (A1)-(A3), (H1)-(H3), we have 
Ojp{n~'^~''Li{n)) with Li{n) a slowly varying function. 

Example. If a > maxj=i^...^rf ^j, then k = [a + minj=i^...^rf^j)/2 < a. 



What is remarkable comparatively to the independence or the short-memory case 
is that converges faster towards ttq when Xt or et are long-range dependent. 
Consider the particular case a = 6i = ... = 9d, then k = a. Further if a G (1/2, 1), 
then, for the estimators of jSi and (52, we have a faster convergence rate than in the 
independence or short-memory case. Finally, the long-memory brings about that 
the true values of the parameters are faster approached. 

Remark also that the obtained convergence rate completely differs from that of 
change-point r-estimators when Xf are NED-dependent (Fiteni [T3], [IS]) - If are 
independent, the convergence rate is for the change-point estimator and n~^/^ 
for the parameters regression estimator, indifferently of used method: M-method 
(Koul et al. [25]), ML- method (Ciuperca and Dapzol [9j), LS-estimation (Bai and 
Perron [3]). Same convergence rate, n~^, is obtained for change-point LS-estimator 
in a model with correlated errors, but not with long-memory (Bai pQ). 
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It is interesting to note that the rate convergence of the change-point estimator 
in the mean of Gaussian variable ([2]), having long-range dependence, considered 
by Horvath and Kokoszka [20], is n'^^g^^{l/5) with g a regular varying function. 
Thus, the estimator of Horvath and Kokoszka [20] is slower than our estimator. 
On the other hand, let us remark that convergence rate of the S-estimators depends 
of the Hermite rank of p{ei/aQ) — K and of the covariance structure of Xt and St- 



4 Proofs of Theorems 



Proof of Theorem l3.1L Let us consider the function e(,^) = iE[s„(?7, vr)— s„(?7°, 7r°)], 
with supposition, without loss the generality, that vr < vr". Using the same argu- 
ments as for (1371) . we obtain that: IE\\Sn{j], vr) — Snij], 7r°)|] < C||/3i — /32|| ■ |7r — 7r°| < 
oo and similarly to flM|) : tt") — Snijf ,'T^'^)\] < C\\rj — rp\\. Thus, function 



e((^) is well-defined. By Lemma [5.31 function e(^) is continuous and furthermore 
e(^°) = 0. For using an argument like the one in Huber [21], we will to prove that: 
IE[sn{j],Ti) — Snirp^T^^)] > 0, for every ^ 7^ Since s„(^) and are both 

solutions of equation (] 
with: 



we have = {S{'l + 5};^) + {S^^i + S',:>) + {S^^' + S',^ 



(0) 



:{o) 



q(0) 



t=l 



(rm) 



'rm) 



S- 



(0) 

3,n 



n 



t=[n-K]+l 



t=[n7rO]+l L 



P 



Si 



We) 



E 



t=l L 



Then, by the mean value theorem (TVM), Sfl + S^'n + S'^'L can be written as: 



n ^ P 

t=[n7r]+l 
n 

' E 

t=[n7r0]+l 
(0) , r.(0) 



Si 1 — n 



' rm) \ 
frjm 



n 



j:rt{P> 



r*(/3?) 



t=i 



+ E rM)^ 



rM, 



+ E rm)i^ 



rM) 



with u^^\ u']^^ u'^'> defined in the same way as in the proof of the Lemma 15.31 
Moreover, using property ffTTl) . we have the following: Si[l + Sf^^- Sf^ = [sni^) — 
SniOWn, where is a positive random variable with probability close to 1. 
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Moreover, using Taylor's expansion, the expressions of S[^l, 82^^ and 5*3^^ can be 
written as: 



t=i 



n-K°] 
t=[mT]+l 



gm _ .,-1 



(1) 

3,n 



77, 



t=[n7r0]+l 

with 61, 62, G (0, 1). By the ergodic theorem, we obtain: 



r.-^ EH - W (^) = o^(l), - ^^^^ 

r^-^ Er=[„.o]+i - m (iff) = Oip(l) 

(14) 

Relation fll4p and assumption (HI) imply: for any ^ 7^ for all e > 0, there exits 
a > 0, such that 

nsl]l + Si]l + Si]l>a]>l-e (15) 

Assumption ([7]), the above relation and 5*1^^ + S'£^ + S^]l = — (5'i°2 + S2^l + S's^^) = 
[sn(0 - 'Sn(C°)]K, with K > 0, imply the conclusion iE[s„(r7, vr) - s„(r7°, 7r°)] > 
0, for all ^ 7^ Using this, the compactness of the parameter space, ^„ = 
argminggXxTx[o,i] ■Sn(0 an argument like one in Ruber ^21j, the strongly con- 
vergence of results. ■ 



Proof of Theorem 13. 2i We first prove that, if we consider in ([T]) the true value 
for 1] and vr, then the scale parameter estimator is strongly consistent: 



s„(r/",7r") ^ ao (16) 

n—i-oo 

Let us observe that in fact Snir]^, vr*^) is the solution of a problem without breaking: 



and then, relation (|T6|) is obtained by Theorem 3.1 of Zhengyan et al. [36]. Now, 
as a consequence of Theorem 13.11 we may consider only the case {1], it) in a neigh- 
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bourhood V(?7 , vr ) of {rj ). Consider the decomposition: 

s„(r7,7r)-s„(?7°,7r°) = [s„(?7, vr) -5^(77°, 7r)] + [s„(?7°, 7r)-s„ (77°, 7r°)] := Si{n) + S2{n) 

(17) 

Since Si{n), depends only on the regression parameters, by Theorem 13.11 taking 
into account relations ( ITOi) and ( l32i) . we readily obtain: 

sup |s„(?7,7r) - s„(r7°,7r)| ^ (18) 

For S2{n), an argument like the one used for (13^ yield that Snijf, vr) — Snijf , vr°) 
behaves as: 



n 



Sn(?7°,7r) 



t=[n7r]+l 

where rt{Pi,P2) — ''^t{Pi)+^t[i^t{P2)~^tiPi)]j with < < 1. Let us remark that 
ft{,Pi,P2) = ^t- We write Taylor's expansion of ip {et/ Sn{ri^ ,t!')) around ipi^t/^^o) 
up to second order: 



t=i 



n 



TT 



t=i 



^^i gn(?/°,7r) - o-Q y^^^^^^/ 



2o-os„(?7",7r) ^ 

with Q = et['Sn(?7°,7r) + i^t(ao - s„(r7°, 7r))]/(o-os„(r7°, vr)), G (0,1). Since ip" is 
bounded, we have ^2^=1 ^ '^"{^t)£t-^t < 00 with probability 1. Moreover: 



vr 



t=i 



El 



a.s 







Hence: n ^ El=f ^ i^t/sniv°, tt)) - ^/^ (^t/fTo)] = ojp(s„(?7°, vr) - ctq)- This rela- 
tion and Yl}t=i ^ Xfip (et/cro) yield that S2{n) = Snirj^, tc) — s„(?7°, 7r°) = 
ojp(l) + 0]p{sn{jf , vr) — (To) = Oip(l) + ojp(S'2(n)), for the last relation we have used 



16]). Then sup^gy(^o) \S2{n 



a.s. 

71— >00 



0. This fact, with relation ( |T8i) . together with 



decomposition (|T7j) and relation (ITUI) . yield the Theorem. 



Proof of Theorem l3.3L For tt G (0, 1) fixed, the convergence rate of the regression 
parameters estimator /5i„(7r) and /92n(7r) is obtained by the application of Zhengyan 
et al. [36] results on every segment. On the other hand, the study of the convergence 
rate of s„ is more difficult because it interferes in both segments. For notational 
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simplicity, in the rest of this proof, we denote (3in = /3in(7r), f32n = /?2n(7r) and 
Sn = Sni'n')- The study will be made in two stages. First, we are going to write 
equation (|T^ (a) in another form, putting in evidence cxo by a limited development. 
Afterwards, in the second stage, the obtained form is studied by taking into account 
the convergence rate of the regression parameters estimators and what Xt, St are 
long-memory Gaussian. 

Stage 1. Equation f|T2l) (a) can be expressed as: 



n 2^X\— ]+n 2^ X\— 



t=i 



We apply ([8]) to function x and for: 

t = 1, ■ ■ ■ , [nvr], Xt = rt0in)/sn, ht = ((Tq"^ - 5^^) 

t = [nir] + 1, ■ ■ ■ ,n,xt = rt02n)/sn, K = (o-q"^ - s"^) rt02n) 
Hence, for the part t = 1, ■ ■ ■ , [nvr], we have: 



n ^ J2 Xi^t + ht) = n ^T,x{xt) + 



t=i 



t=i 



O-QSn 



n ^J2rt{Pin)i^{xt) 



t=i 



+ n 



-1 Sn— Q-Q 



t=l 



(19) 



E rK/3in) /o^(i - sW + ^ ■ n(/?i„) - £) 



(20) 

Thus, in order to study the first sum of ffT^ . we shall analyse the terms of the 
right-hand side of (1201) . 

We first consider the last term of the right-hand side of ( !20l) . Elementary algebra 
yields that: 



n 



-1 



[riTr] 



n 



-1 



t=i 



t=i 



t=i 



By the ergodic theorem n ^ ^t = Ojp(l), n ^ X'^Xt = Ojp(l) and since 
Et and Xt are independent, we have n~^J2^^i^tXt = ojp(l). Thus, since tjj' is 
bounded, s„ — ctq = Ojp(l), 5„ > with probability 1, the last term of the right- 
hand side of (PUI) is Ojp(l). 

We now consider the second term of the right-hand side of ( l20l) . For the sum, we 
have: 



n 



J2 rt{f3in)'ip 



In I 



Vt(/??) 



t=i 
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< n 



[nvr] 



t=i 



In 



-n 



t=i 



and since is bounded: 



[n7r] 



<C\\^n-(3nn-'T.\\Xt\\+Cn-'Y: 



t=i 



t=i 



The above inequality, with the ergodic theorem, < oo, iE'[||£t||] < oo and 

Pin - Pi = Oip{l), imply that 



n 



t=i 



t=l 



Thus, the second term of the right-hand side of (pn|) can be expressed 



t=i 



Then, relation (|20l) becomes: 



[nvr] 



[rtTr] 



[nvr] 



n 



t=i 



t=i 



A similar relation holds for the part t = [nvr] + 1, ■ ■ ■ , n: 



n 



Sn — <^0 I -1 



n 



t=[n7r]+l 



t=[n7r]+l 



t=[n7r]+l 



(22) 



Adding (1211) and (l22ll . taking into account the relation (jT9l) . we obtain: 



= n-iEx 



In; 



O"0 



t=[mT]+l 



O"0 



n-^E^*^ (-)+Ojp(l) 



By ergodic theorem: n ^ Y.7=i i^t/cro) JE [eiip {si/ao)]. 

Stage 2. Then, the convergence rate of Sn will be obtained by studying: 



n 



In 



t=l 



+ ^ ^ E X 

t=[n-K]+l 



2n 



IE 



Eli) ( — 

O"0 



+ Ojp(l) 

(23) 
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For t = 1, ■ ■ ■ , [mr], making the Taylor's expansion of x up to second order, we 
obtain that J2t=i X (o^cT^^il/^in)) can be written as: 



n 



, (^t 



t=l 



2a, 



t=\ 



(24) 



Let us analyse the three terms of the previous equation separately. 

• For the first term, let us Vt = e^/cJo ~ A/'(0, 1) denote. We use the Hermite 

expansion for xi'^t)- Because the Hermite rank of xi'^t) is Qi, Qi > 2, by ffT5|) 

below: 

M T / \ [nw] [mr] j / x 

Ex(^t) = ^E^..(^*) + E E ^i^,(^.):-T,, + T,, (25) 

t=i yi- t=i t=i g>qi+i y- 



For Ti^n we have: 

j2(x)SS 



EE(^i07^^(Ml) = (?i 



(91 



[nvr] — 1 



[nn] -— 1 

M7^Ho) + 2 E (M-^)7^'W 



0(n) + 2 E ([H - t)r'"''L'''{t) 



t=i 



Jlip) r 



0(n) + 0(n2-"«^)L«i([n7r]) 



0(n2-"''i)L''i([n7r]) 



For T2 „ we have: 



mn]= E 



[■n-Tr] [nvr] 

EE7^(I^-j1) 

t=l 3=1 



< E ^EE7-+^(|t-j|) 



= 0(n)+2 E E ([H-^)7'"^'(^) < 0{n)+2 E E (M-^)^"^'"^'^"^''^' W 

g>gi+i ^' t=i g>gi+i 9- t=i 

= 0(n2-(^i+i)"L''i+i([n7r])) 

Hence iE[T2^„] = o(iE[Tf„]). Then, for equation ( |25l) . we straightforwardly have: 



E x(^0 = OiP (iK[T^j)'^' = 0^(n^--''^/2)L''^/^(H) 



(26) 



t=i 



For the second term of fl24p . since Vt and are independent, by ergodic theorem. 
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we have: 



n 



(27) 



t=l 



For the third term of since ip' is bounded and n ^ XtXf = Ojp(l), we 
have: 



n 



' E X" (^t) [Xt0in - = Ojpmn - /3?f ) = 0^(11 A. - (28) 



t=i 



Then, by taking <^U^, (I2HD into account, the behaviour of iHM is given by 
and it is 0]p{n-''''^/'^)L'^^/\[mi]) + ojp( - /?°||). Similar one reasoning is made 
for the part t = [nn] + 1, • • ■ , n and we obtain that: Ylt=[mT]+i X {yo^^t02n)^ = 
Ojp(n-"'?i/2)^gi/2(^(^ _ ^ oip{\\(32n - PU)- Then, for relation we have: 



<^0 ~ Sn 



Slip { — 
O"0 



Oip(l) 



and the convergence rate of s„ follows. 



Proof of Theorem 13. 4i As a consequence of Theorem 13.1^ we consider tt in a 
neighbourhood of 7r°. We suppose, without loss of generality, that vr < vr^. Consid- 
ering relation (fT2l) (c). we have: 

t=[n7r]+l (/?ln(7r),/32n(7r),7rjy t=[n7rO]+l V (An (tt) , /32n (tt) , TT j 

(29) 

Since ||/32n(7r) — /^^ || = Ojpiji ''Li{n)), an argument like the one used for relation 
([27]) yield that the right-hand side of ([29]) is Oip{n-''Li{n)). 

We apply ([HD to function ^, for: = , , /i* = f^^^fil^','^ n - For 

^ ^' s„(/3i„(7r),/32„(7r),7r) ' s„(/3i„(7r),/32„(7r),7r) 



the left-hand side of (!29l) . since s^i (/3in(7r), /32n(7r), vrj ^ ao a.s. for n ^ oo, and 
Pi 1^ obtain : 



n 



(30) 

But, making Hermite expansion of ipii^t), we get: 



t=[n7r]+l ^^0^ ^0 t=[mT]+l t=[mT]+l q>l 
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where: Jq{ip) = Eliplui) Hglui)]. On the other hand, as in the proof of Theorem 
\'S.'S\ we have /2,n = ojp(/i,„). The variance of Ji „ is: 

t2/„/,N [n(7rO-7r)] [n(7r"-7r)] 
[ri(7rO-7r)] 

= [ri(7r° - 7r)]7(0)r(0) + 2 E - n) - i]-f{i)T{i) 

i=l 

= O (Lini-R^ - 7r))/:^(n(7r° - 7r))M(n(7r° - 7r))/:(n(7r° - tt))) 
What imphes: 

E ^ f— ) ^* = Op ((n(7r° - 7r))-"^'"(''»+")/2^i/2(n(7r° - 7r))/:^(n(7r° - 7r))£(n(7r° - tt)) 

This last relation together with fl2^ . flHUl) and since the right-hand side of ([221) is 
Oip{n~^Li{n)) imply: Oip{n~^Li{n)) 

= Ojp(n(7r°-7r))+Ojp ((n(7r° - 7r))-"^*°(^»+")/2i:i/2(^(7r° - 7r))/:^(n(7r° - 7r))£(n(7r° - tt))) 
We obtain that: 7r„ — 7r° = 0]p{n^^^''Li{n)). ■ 



5 Lemmas 



Lemma 5.1 If solution Sn{0 of equation 0) exists, then it is well-defined, bounded, 
strictly positive, with a probability arbitrarily large. 

Proof of LemmaEm Since ]E[rt{p) = 0] and Var[rt{P)] = Var[et]+pVar[X]P^ < 
00, by Bienayme-Tchebichev inequality, we obtain that rt{(3) is bounded with a 
probability arbitrarily large. 

We prove that s„(^) is bounded by reduction to absurdity. If is not bounded 
then: there exists ^ G T x T x (0, 1) and G N such that for all n > n^, M > 0, 
exists e > such that: _F'[s„(^) > M] > 1 — e. Since p is continuous and p(0) = 0, 
then: 



^ 0, t=l,...,n (31) 



+00 



17 



and 




[ravr] (rtiPi)\ , n - [nn] n^h 
< max p — — - H max p 



n l<i<[n7r] Y j n [n7r]+l<t<n \^ ) 

which, by (1311) . converges to in probabihty, for n oo. What is contradictory 
with (jl]). To prove that s„(^) > 0, let us consider function g{P,s) = {e — XP)/s, 
with /5 in a compact of containing and s G (0, oo). Since e — X(3 is bounded 
with a probabihty close to 1, if Sn{C) = 0, thus \vais^o\g{,(^, s)\ = oo, what 
is contradictory with (jl]). Hence, for all e > 0, there exists 6 > such that 

xTx[0,l] 



Lemma 5.2 Under assumptions (Al)-(A3), for any e G (0, 1), ,^ G T x T x [0, 1], 
there exists a positive constant 5 such that: ]P\ inf DniO > > 1 — e. 



Proof of Lemma 15. 2i Because ^ belongs to a compact and taking into account 
relation ffTTl) . we have to prove that for all e > 0, G T x T x [0, 1], there exists a 
6 > such that: 



n 



[mi\ 



t=i 



> 5 



> 1 



Since rt{(3), ip (7^^) have the same sign and since ijj is continuous, we are going 
to show only that, for all e > 0, for all P in compact set T, there exists a > 
such that: r[\e - X/3\ > 5i] > 1 - e. 

Random variables e and X are Gaussian and independent. Then: lP[\e ~ X(3\ > 



Si 



2P[6 - X/3 < -61] 



2$ 



. We recall that ^ denotes 



[7(0)+/3r(0)/3i^]i/2^ 

the standard Gaussian distribution. Then, the Lemma results by setting: 61 



Tll/2 



The key for strong convergence proof is the following uniform convergence result. 

Lemma 5.3 For all g > 0, under assumptions (A1)-(A3), for 
VtgiX) = G T X T X [0, 1]; \\r] - r]*\\ < ^, Itt - 7r*| < q], we have: 



IE 



sup \sn{ri,'K) - Sn{ri* 



>0 
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Proof of Lemma 15. 3i We have the triangular inequahty: 

|s„(r7,7r) - s„(r/*,7r*)| < |s„(r7,7r) - s„(?7,7r*)| + |s„(77,7r*) - s„(r/*, 7r*)|. First, we 
will study Sn{ri, IT*) — Snirj*,^*). By the mean value theorem (TVM), we have: 

Sniv, vr*) - Sniv*, vr*) = (A - vr*) + - /^Dtt^^ /?2, vr*) (32) 

where A = A + - (31), (32 = (32 + V2{(32 - (32), vi, V2 G (0, 1). By Lemma 



I5.2[ applying Cauchy-Schwarz inequahty in fjTOj) and taking into account that ip is 
bounded, we obtain: 



dsM,P*2,^*) 



[nvr] / 

<Cn-'J2iE[X^]E 
t=i \ 



n{h) 



1/2 



<C (33) 



^s„(/3i,/3|,7r*)^ 

Then, writing a similar relation for ((9s„/9/52)(/3i, /52, vr*), we have for (!32|) : 

iK[|s„(r/,7r*)-s„(r/*,7r*)|] -^0, for ^ (34) 



Let us remark that if vr* = or vr* = 1, then in relation (|32|) . the term in 
respectively (32, does not appear. 

Now, we study |sn(?7, vr) — s„(r7,7r*)|, supposing that vr < tt*. Since s„(?7,7r) and 
s„(?7,7r*) are both solutions of (jl]), we have: 



[n-K \ 

t=i 



' r,(A) 

,Sn(^,7^) 



Sn(^,7r*) 



t=[n7r*]+l 



• n{fi2) 



' T02) 



[n7r]+l 

Thus, applying the MVT: 



' n{(3,) 

.s„(r/,vr) 



• niP2) 

.Sn(^,vr) 



= n-'EK,|.f,(A-A; 



where u^^\ u^^-* are two positive bounded functions, not necessarily solutions of 
and rt{(3i, /^s) = rt{(3i) + mt[rt{(32) - rt{(3i)], with < < 1. By relation ([II]): 



(35) 



E rt{(3i)^ 



t=i 



' r,(A) 

,Un\r], IT, TT* 



E 



t=[n7r*]+l 



Un\ri, vr, vr*) 



> 



(36) 
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with a probability close to 1. On the other hand: rt{Pi, P2) =Yt — Xt[l3i + mt{P2 — 
Pi)] = ^tiPi + ^t{P2 — Pi))- Using the same arguments as for flHHl) . we obtain that: 

1/2 

< Ci{n*-7i) 

where Ci is a vector with all bounded components. Taking into account also fl36|) . 
we obtain for flH31) : 

E[\Sniv,T^)-Sn{V,^*)\]<C\\Pl-P2\\ ' |vr - VT* | < C^— >0 (37) 

Relations ( |34l) and ( 1371) imply the Lemma. ■ 



n 



2 / n{Pi,p2) 
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